add nvidia distribution #565

cdgamarose-nv · 2024-12-04T03:29:24Z

What does this PR do?

adds nvidia template for creating a distribution using inference adapter for NVIDIA NIMs.

Test Plan

Please describe:
Build llama stack distribution for nvidia using the template, docker and conda.

(.venv) local-cdgamarose@a4u8g-0006:~/llama-stack$ llama-stack-client configure --endpoint http://localhost:5000
Done! You can now use the Llama Stack Client CLI with endpoint http://localhost:5000
(.venv) local-cdgamarose@a4u8g-0006:~/llama-stack$ llama-stack-client models list
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┓
┃ identifier                       ┃ provider_id ┃ provider_resource_id       ┃ metadata ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━┩
│ Llama3.1-8B-Instruct             │ nvidia      │ meta/llama-3.1-8b-instruct │ {}       │
│ meta-llama/Llama-3.2-3B-Instruct │ nvidia      │ meta/llama-3.2-3b-instruct │ {}       │
└──────────────────────────────────┴─────────────┴────────────────────────────┴──────────┘
(.venv) local-cdgamarose@a4u8g-0006:~/llama-stack$ llama-stack-client inference chat-completion --message "hello, write me a 2 sentence poem"
ChatCompletionResponse(
    completion_message=CompletionMessage(
        content='Here is a 2 sentence poem:\n\nThe sun sets slow and paints the sky, \nA gentle hue of pink that makes me sigh.',
        role='assistant',
        stop_reason='end_of_turn',
        tool_calls=[]
    ),
    logprobs=None
)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Ran pre-commit to handle lint / formatting issues.
Read the contributor guideline,
Pull Request section?
Updated relevant documentation.
Wrote necessary unit or integration tests.

…o cdgamarose/add_nvidia_distro merging matt's changes

…o cdgamarose/add_nvidia_distro

cdgamarose-nv · 2024-12-04T03:30:04Z

@mattf for viz

…ia_distro merged with upstream

cdgamarose-nv · 2025-01-10T22:04:40Z

I used a dev docker image to test this (distribution-nvidia:dev). Do we push the image to dockerhub, or there a process to follow?

ashwinb · 2025-01-14T04:23:39Z

@cdgamarose-nv yeah we are setting up CI for automatically uploading dockers when we make them for the supported distributions. Pretty imminent!

ashwinb · 2025-01-14T04:25:04Z

distributions/inline-nvidia/run.yaml

@@ -0,0 +1,100 @@
+version: '2'


I don't think you need an inline-nvidia distro at all?

ashwinb

need to remove inline-nvidia

yanxi0830 · 2025-01-15T22:03:00Z

I used a dev docker image to test this (distribution-nvidia:dev). Do we push the image to dockerhub, or there a process to follow?

@cdgamarose-nv yes! we have just added a Github workflow to build docker images and update to dockerhub: https://github.com/meta-llama/llama-stack/actions/workflows/publish-to-docker.yml

yanxi0830 · 2025-01-15T22:03:23Z

Merging this PR in and rebasing on top to remove inline-nvidia template as offline synced with @ashwinb.

yanxi0830 · 2025-01-15T22:17:19Z

Removed inline-nvidia template in 27e07b4

mattf and others added 15 commits November 4, 2024 10:23

add NVIDIA NIM inference adapter

2dd8c4b

enable streaming support, use openai-python instead of httpx

dbe665e

Merge branch 'main' into add-nvidia-inference-adapter

43262df

Merge branch 'main' into add-nvidia-inference-adapter

c24f882

Merge branch 'main' into add-nvidia-inference-adapter

2a25ace

map llama model -> provider model id in ModelRegistryHelper

2980a18

Merge remote-tracking branch 'mattf/add-nvidia-inference-adapter' int…

18e8f18

…o cdgamarose/add_nvidia_distro merging matt's changes

Add nvidia remote distro

a5d4130

align with other remote adapters, rename config base_url -> url

4ccf4ef

Merge branch 'main' into add-nvidia-inference-adapter

8a35dc8

Added distributions for inline and remote

6759744

Merge remote-tracking branch 'mattf/add-nvidia-inference-adapter' int…

3b5ea74

…o cdgamarose/add_nvidia_distro

Merge conflicts

5b027d2

Reverted outdated changes

deecc27

Changing fireworks to nvidia in docs

091d196

cdgamarose-nv requested review from ashwinb, yanxi0830, hardikjshah, dltn and raghotham as code owners December 4, 2024 03:29

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 4, 2024

cdgamarose-nv marked this pull request as draft December 4, 2024 03:30

mattf mentioned this pull request Dec 4, 2024

add nvidia nim inference provider to docs #534

Merged

5 tasks

cdgamarose-nv added 4 commits December 4, 2024 09:32

Added nvidia as remote hosted distro

1658a5f

Merge remote-tracking branch 'upstream/main' into cdgamarose/add_nvid…

10faffc

…ia_distro merged with upstream

updated nvidia distro with recent apis

26a093a

added newline to end of file

9065716

cdgamarose-nv marked this pull request as ready for review January 10, 2025 22:02

cdgamarose-nv requested a review from dineshyv as a code owner January 10, 2025 22:02

cdgamarose-nv requested review from vladimirivic and sixianyi0721 as code owners January 10, 2025 22:02

ashwinb reviewed Jan 14, 2025

View reviewed changes

distributions/inline-nvidia/run.yaml

@@ -0,0 +1,100 @@

version: '2'

Copy link

Contributor

ashwinb Jan 14, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think you need an inline-nvidia distro at all?

ashwinb requested changes Jan 14, 2025

View reviewed changes

yanxi0830 approved these changes Jan 15, 2025

View reviewed changes

yanxi0830 merged commit b3202bc into meta-llama:main Jan 15, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add nvidia distribution #565

add nvidia distribution #565

cdgamarose-nv commented Dec 4, 2024 •

edited

Loading

cdgamarose-nv commented Dec 4, 2024

cdgamarose-nv commented Jan 10, 2025

ashwinb commented Jan 14, 2025

ashwinb Jan 14, 2025

ashwinb left a comment

yanxi0830 commented Jan 15, 2025

yanxi0830 commented Jan 15, 2025 •

edited

Loading

yanxi0830 commented Jan 15, 2025

add nvidia distribution #565

add nvidia distribution #565

Conversation

cdgamarose-nv commented Dec 4, 2024 • edited Loading

What does this PR do?

Test Plan

Before submitting

cdgamarose-nv commented Dec 4, 2024

cdgamarose-nv commented Jan 10, 2025

ashwinb commented Jan 14, 2025

ashwinb Jan 14, 2025

Choose a reason for hiding this comment

ashwinb left a comment

Choose a reason for hiding this comment

yanxi0830 commented Jan 15, 2025

yanxi0830 commented Jan 15, 2025 • edited Loading

yanxi0830 commented Jan 15, 2025

cdgamarose-nv commented Dec 4, 2024 •

edited

Loading

yanxi0830 commented Jan 15, 2025 •

edited

Loading